Method and device for transmitting metadata in WFD

ABSTRACT

A method and a device for transmitting metadata in a WFD are provided. Particularly, a WFD source forms a WFD frame including a WFD header and metadata. The WFD source transmits, to a WFD sink, a packet stream created by packetizing a metadata stream including the WFD frame. The WFD header includes a metadata structure field and a metadata update indicator. The metadata structure field indicates whether the metadata has a video metadata structure or an audio metadata structure. The metadata update indicator indicates that previous metadata is updated such that all of the previous metadata is replaced with new metadata, or that the previous metadata is incrementally updated together with updated metadata.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is the National Stage filing under 35 U.S.C. 371 of International Application No. PCT/KR2016/008188, filed on Jul. 27, 2016, which claims the benefit of U.S. Provisional Applications No. 62/197,109 filed on Jul. 27, 2015, No. 62/200,106 filed on Aug. 2, 2015, No. 62/200,126 filed on Aug. 3, 2015, No. 62/201,115 filed on Aug. 5, 2015, No. 62/201,592 filed on Aug. 6, 2015, No. 62/201,580 filed on Aug. 6, 2015, No. 62/202,150 filed on Aug. 6, 2015, No. 62/201,581 filed on Aug. 6, 2015, No. 62/203,378 filed on Aug. 10, 2015, No. 62/203,415 filed on Aug. 11, 2015, No. 62/238,107 filed on Oct. 6, 2015, and No. 62/238,106 filed on Oct. 6, 2015, the contents of which are all hereby incorporated by reference herein in their entirety.

BACKGROUND OF THE INVENTION Field of the Invention

The present invention relates to wireless communication and more particularly, a method and device for transmitting metadata in WFD (WiFi (Wireless Fidelity) Display).

Related Art

Although the performance of mobile devices has evolved remarkably to be considered as the equivalent of personal computers (PCs), the mobile device still has its limitations in screen size. Most particularly, since portability is one of the most important features of the mobile device, it marginal line for screen size is said to be equal to 6 inches. However, for users who enjoy viewing multi-media content, a 6-inch display may still be considered to be a small screen.

Accordingly, extensive research is being carried out for developing a technology enabling users to watch images (or video) that were viewed in the mobile device through large-sized television (TV) screens or monitor screens. This technology may also be expressed by using the term wireless display transmission technology. The wireless display transmission technology may be broadly divided into content transmission and mirroring (screen casting). Instead of directly transmitting the display screen of the mobile device as it is, content transmission should be associated with Video on Demand (VOD) services. More specifically, content transmission is a method of sending images (or video) via signal, and mirroring is a method of sending a content filed to a remote device via streaming in order to show (or display) the same content once again through a large-sized screen, such as a TV screen.

Just as implied in the term itself, mirroring (screen casting) corresponds to a method of simultaneously showing a screen that is displayed on a mobile device as though it is reflected on a mirror. This is similar to a method of projecting a computer screen through a projector, when performing a presentation by establishing a wired connection, such as D-Subminiature (D-sub) (RGB), Digital Visual Interface (DVI), High-Definition Multimedia Interface (HDMI). The mirroring method is advantageous in that it can wirelessly transmit pixel information of an original screen without modification in real-time without being dependent on a specific service.

WiFi Miracast is being researched as a wireless display transmission technology using Wi-Fi. Miracast is a wireless image (or video) transmission standard as well as a wireless display transmission technology. Miracast is a type of mirroring (screen casting) technology, wherein the display screen and sound are compressed and outputted to a wireless LAN and then the compressed display screen and sound are decompressed in a dongle or an all-in-one receiver, thereby being displayed on the screen.

SUMMARY OF THE INVENTION

The present invention provides a method for transmitting metadata in WFD and a device using the method.

The present invention proposes a method for transmitting metadata in WFD.

First, to clarify terms, since WFD R2 may also be expressed as WFD2, this document uses the terms of WFD R2 and WFD2 interchangeably. Also, since embodiments according to the present invention are described based on the Miracast R2, WFD is used as a concept including the WFD R2. Accordingly, a WFD source is employed as a concept including a WFD R2 source while a WFD sink is employed as a concept including a WFD R2 sink.

First, a WFD source constitutes a WFD frame which includes a WFD header and metadata.

A WFD source transmits a packet stream, which is a stream created by packetizing a metadata stream including the WFD frame, to a WFD sink. A packet stream may correspond to a PES or PES packets.

The WFD header includes a metadata structure field and a metadata update indicator. The metadata structure field may correspond to wfd2_data_structure_type, and the metadata update indicator may correspond to wfd2_data_update_indicator.

The metadata structure field indicates whether metadata is configured by a video metadata structure or an audio metadata structure. In other words, metadata structure may be distinguished by the wfd2_data_structure_type value.

The metadata update indicator indicates whether a first update or a second update is performed on the metadata. In other words, an update method may be chosen according to the value set to the wfd2_data_structure_type. The first update may correspond to a full update in which all of the previous metadata is replaced with new metadata. The second update may correspond to an incremental update in which updated metadata is used to update previous metadata incrementally. In other words, different from the first update, if the second update is performed, only the updated metadata may be included in the WFD frame. At this time, metadata may correspond to metadata_record.

Also, a metadata stream is determined by a metadata descriptor. A metadata descriptor includes a metadata application format configured by a predetermined first value and a metadata format configured by a predetermined second value. According to an embodiment of the present invention, the value of the metadata application format may be set to 0xFF10 corresponding to the predetermined first value while the value of the metadata format may be set to 0xC8 corresponding to the predetermined second value.

Also, metadata includes information about video contents and information about audio contents. Information about video contents may include protection information indicating whether video contents are protected and URL information of the video contents.

Also, the information about video contents and the information about audio contents may be formatted by the US-ASCII string and UTF-8 string.

Also, metadata may include information about a text format and/or information about graphic data. If metadata includes information about a text format and/or information about graphic data, the WFD R2 header of the WFD R2 frame, which is a frame for transmitting the metadata, may be formatted by wfd2_tag, wfd2_data_structure_tag, Country code, and wfd2_subtitle_infor_length.

Also, a metadata stream may be multiplexed with an au audio stream or a video stream and transmitted as a transport stream. In the WFD R2, an auxiliary stream and/or metadata stream may be may be multiplexed together with an audio stream and/or video stream in the MPEG2-TS and transmitted as a transport stream. At this time, each stream may be packetized through a PES packetizer and generated for each of a plurality of PES. In other words, a video stream generates a PES with respect to a video stream through a video PES packetizer, an audio stream generates a PES with respect to an audio stream through an audio PES packetizer, an auxiliary stream generates a PES with respect to an auxiliary stream through an auxiliary PES packetizer, and a metadata stream generates a PES with respect to a metadata stream through a metadata PES packetizer.

Through the method described above, a WFD source may transmit metadata to a WFD sink.

The present invention proposes a device for transmitting metadata through WFD. The device may correspond to a WFD source, but the present invention is not limited to the aforementioned case.

First, to clarify terms, since WFD R2 may also be expressed as WFD2, this document uses the terms of WFD R2 and WFD2 interchangeably. Also, since embodiments according to the present invention are described based on the Miracast R2, WFD is used as a concept including the WFD R2. Accordingly, a WFD source is employed as a concept including a WFD R2 source while a WFD sink is employed as a concept including a WFD R2 sink.

A WFD device includes an RF unit transmitting and receiving a radio signal and a processor coupled operatively to the RF unit.

The processor configures a WFD frame including a WFD header and metadata. Also, the processor transmits a packet stream, which is a stream created by packetizing a metadata stream including the WFD frame, to a WFD sink. A packet stream may correspond to a PES or PES packets.

The WFD header includes a metadata structure field and a metadata update indicator. The metadata structure field may correspond to wfd2_data_structure_type, and the metadata update indicator may correspond to wfd2_data_update_indicator.

The metadata structure field indicates whether metadata is configured by a video metadata structure or an audio metadata structure. In other words, metadata structure may be distinguished by the wfd2_data_structure_type value.

The metadata update indicator indicates whether a first update or a second update is performed on the metadata. In other words, an update method may be chosen according to the value set to the wfd2_data_structure_type. The first update may correspond to a full update in which all of the previous metadata is replaced with new metadata. The second update may correspond to an incremental update in which updated metadata is used to update previous metadata incrementally. In other words, different from the first update, if the second update is performed, only the updated metadata may be included in the WFD frame. At this time, metadata may correspond to metadata_record.

Also, a metadata stream is determined by a metadata descriptor. A metadata descriptor includes a metadata application format configured by a predetermined first value and a metadata format configured by a predetermined second value. According to an embodiment of the present invention, the value of the metadata application format may be set to 0xFF10 corresponding to the predetermined first value while the value of the metadata format may be set to 0xC8 corresponding to the predetermined second value.

Also, metadata includes information about video contents and information about audio contents. Information about video contents may include protection information indicating whether video contents are protected and URL information of the video contents.

Also, the information about video contents and the information about audio contents may be formatted by the US-ASCII string and UTF-8 string.

Also, metadata may include information about a text format and/or information about graphic data. If metadata includes information about a text format and/or information about graphic data, the WFD R2 header of the WFD R2 frame, which is a frame for transmitting the metadata, may be formatted by wfd2_tag, wfd2_data_structure_tag, Country code, and wfd2_subtitle_infor_length.

Also, a metadata stream may be transmitted as a transport stream after being multiplexed with an audio stream or a video stream. In the WFD R2, an auxiliary stream and/or metadata stream may be multiplexed together with an audio stream and/or video stream in the MPEG2-TS and transmitted as a transport stream. At this time, each stream may be packetized through a PES packetizer and generated for each of a plurality of PES. In other words, a video stream generates a PES with respect to a video stream through a video PES packetizer, an audio stream generates a PES with respect to an audio stream through an audio PES packetizer, an auxiliary stream generates a PES with respect to an auxiliary stream through an auxiliary PES packetizer, and a metadata stream generates a PES with respect to a metadata stream through a metadata PES packetizer.

In a wireless communication system supporting the WFD R2, metadata is transmitted from a WFD source to a WFD sink, and therefore, useful information may be delivered to the user in an efficient manner. Also, various types of metadata may be configured, and therefore, various kinds of information about contents may be provided for the user.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a conceptual view illustrating Wi-Fi Direct Service (WFDS) configuration elements (or components).

FIG. 2 is a conceptual view illustrating a WFD network.

FIG. 3 is a conceptual view illustrating a WFD session.

FIG. 4 is a conceptual view illustrating a WFD session configuration method.

FIG. 5 is a conceptual view illustrating a network between a WFD source and a WFD sink.

FIG. 6 is a conceptual view illustrating WFD capability exchange and negotiation procedures.

FIG. 7 is a conceptual view illustrating a WFD session establishment procedure.

FIG. 8 is a conceptual view illustrating a method for transmitting and receiving a plurality of streams based on MPEG-TS according to an exemplary embodiment of the present invention.

FIG. 9 is a flow diagram of including information useful for a user of the Miracast R2 into metadata and delivering the metadata through the RTSP message transmission scheme according to an embodiment of the present invention.

FIG. 10 illustrates a method for packetizing a plurality of streams based on the MPEG2-TS and transmitting the packetized streams according to an embodiment of the present invention.

FIG. 11 illustrates a structure of a PES packet according to an embodiment of the present invention.

FIG. 12 is a flow diagram illustrating a procedure of transmitting metadata in the WFD R2 according to an embodiment of the present invention.

FIG. 13 is a block view illustrating a wireless device to which the exemplary embodiment of the present invention can be applied.

DESCRIPTION OF EXEMPLARY EMBODIMENTS

In the conventional wireless LAN system, operations between devices (access point (AP) and station (STA)) within an infrastructure basic service set (BSS), wherein the access point (AP) functions as a hub, were mostly defined. The AP may perform a function of supporting a physical layer for a wireless/wired connection, a routing function for devices within the network, a function of adding/removing devices to/from the network, a function of providing services, and so on. More specifically, in the conventional wireless LAN system, devices within the network are connected through the AP and not connected to one another by direct connection.

As a technology for supporting direct connection between devices, a Wi-Fi Direct standard is being defined. Wi-Fi Direct is a direct communication technology allowing devices (or stations (STAs)) to be easily connected to one another without an access point, which is essentially required in the conventional wireless LAN system. In case Wi-Fi Direct is being used, connection between the devices is configured without any complicated configuration (or set-up) procedures. Thus, diverse services may be provided to the user.

In the Wi-Fi Alliance (WFA), a Wi-Fi Direct service (WFDS), which supports diverse services using a Wi-Fi Direct link (e.g., Send, Play, Display, Print, and so on), is being researched. According to the WFDS, an application may be controlled or managed by a service platform, which is referred to as an Application Service Platform (ASP).

A WFDS device supporting a WFDS includes devices supporting a wireless LAN system, such as display devices, printers, digital cameras, projectors, smart phones, and so on. Also, the WFDS device may include an STA and an AP. WI-DS devices within a WFDS network may be directly connected to one another.

FIG. 1 is a conceptual view illustrating Wi-Fi Direct Service (WI-DS) configuration elements (or components).

Referring to FIG. 1, a WFDS framework may include a Wi-Fi Direct layer 100, an ASP 120, a service layer 140, and an application layer 160.

The Wi-Fi Direct layer 100 corresponds to a medium access control (MAC) layer, which is defined in the Wi-Fi Direct standard. A wireless connection may be configured by a physical layer (not shown), which is backward compatible with a Wi-Fi PHY, below the Wi-Fi Direct layer 100. An Application Service Platform (ASP) 120 is defined above the Wi-Fi Direct layer 100.

The ASP 120 corresponds to a common shared platform, and the ASP 120 performs the functions of session management, service command processing, and control and security between ASPs between the Application layer 160, which is a higher layer of the ASP 120, and the Wi-Fi Direct layer 100, which is a lower layer of the ASP 120.

The Service layer 140 is defined above the ASP 120. For example, the Service layer 140 may support 4 basic services, which correspond to Send, Play, Display, and Print services, and a service that is defined in a third-party application. Moreover, the Service layer 140 may also support Wi-Fi Serial Bus (WSB), Wi-Fi Docking, or Neighbor Awareness Networking (NAN).

The application layer 160 may provide a user interface (UI), and the application layer 160 may also express information in a format that can be recognized by human beings and may deliver (or transport) user input to a lower layer.

Hereinafter, the exemplary embodiment of the present invention will disclose a wireless fidelity (Wi-Fi) display (WFD) among the WI-DS in more detail.

The WFD standard was defined for performing transmission of audio/video (AV) data between devices while satisfying high quality and low latency. Wi-Fi devices may be connected to one another by using a peer-to-peer method through a WFD network (WFD session) to which the WFD standard is applied without passing through a home network, an office network, or a hotspot network. Hereinafter, a device that transmits and receives data in accordance with the WFD standard may be expressed by using the term WFD device. The WFD devices within the WRD network may discover information on the WFD devices (e.g., capability information) corresponding to one another. And, then, after configuring (or setting up) the WFD session, the WFD devices may perform rendering on the contents through the WFD session.

A WFD session may correspond to a network between a source device, which provides contents, and a sink device, which receives and performs rendering of the contents. The source device may also be expressed by using the term WFD source, and the sink device may also be expressed by using the term WFD sink. The WFD source may perform mirroring of the data existing in the display (or screen) of the WFD source to the display of the WFD sink.

The WFD source and the WFD sink may exchange a first sequence message between one another and may then perform device discovery and service discovery procedures. After completing the device discovery and service discovery procedures between the WFD source and the WFD sink, an internet protocol (IP) address may be allocated to each of the WFD source and the WFD sink. A transmission control protocol (TCP) connection may be established between the WFD source and the WFD sink, and, afterwards, real-time streaming protocol (RTSP) and real-time protocol (RTP) stacks corresponding to the WFD source and the WFD sink may be activated.

A capability negotiation procedure between the WFD source and the WFD sink may be performed through the RTSP, and, while the capability negotiation procedure is being carried out, the WFD source and the WFD sink may exchange RTSP-based messages (message (M) 1 to M4). Thereafter, the WFD source and the WFD sink may exchange WFD session control messages. Also, a data session may be established between the WFD source and the WFD sink through the RTP. In the WFD network, a User Datagram Protocol (UDP) may be used for data transport.

FIG. 2 is a conceptual view illustrating a WFD network.

Referring to FIG. 2, as WFD devices, a WFD source 200 and a WFD sink 250 may be connected to one another based on WiFi-P2P.

Herein, the WFD source 200 may refer to a device supporting streaming of multimedia contents through a WiFi peer to peer (P2P) link, and the WFD sink 250 may refer to a device performing procedures of receiving multimedia contents from the WFD source 200 through the P2P link and generating image and/or sound. The procedure of generating image and/or sound may also be expressed by using the term rendering.

The WFD sink 250 may be divided into a primary sink and a secondary sink. Most particularly, when independently connected to the WFD source 200, the secondary sink may perform rendering only on an audio payload.

FIG. 3 is a conceptual view illustrating a WFD session.

The first part starting from the top of FIG. 3 corresponds to an audio-only session. A WFD source 300 may be connected to any one of a primary sink 305 and a secondary sink 310 through the audio-only session.

The second part starting from the top of FIG. 3 corresponds to a video-only session. A WFD source 320 may be connected to a primary sink 325.

The third part starting from the top of FIG. 3 corresponds to an audio and video session, and, just as in the video-only session, a WFD source 340 may be connected to a primary sink 345.

The fourth part starting from the top of FIG. 3 discloses a session connection in a Coupled WFD Sink operation. In the Coupled Sink WFD operation, the primary sink 365 may perform video rendering, and the secondary sink 370 may perform audio rendering. Alternatively, the primary sink 365 may perform both video rendering and audio rendering.

The above-described WFD sessions may be established after performing the procedures described below in FIG. 4.

FIG. 4 is a conceptual view illustrating a WFD session configuration method.

Referring to FIG. 4, the WFD session may be set up (or configured) after performing WFD Device Discovery S401, WFD Service Discovery S402, WFD Connection Setup S403, and Capability Exchange and Negotiation S404 procedures.

More specifically, in the procedure of WFD Device Discovery S401, the WFD source may discover a peer device for WFD, i.e., WFD sink by performing the WFD Device Discovery procedure.

A beacon frame, a probe request frame, and a probe response frame, which are transmitted by the WFD source and the WFD sink for the WFD Device Discovery, may include WFD Information Element (IE). Herein, WFD IE may correspond to an information element including information related to the WFD, such as device type, device status, and so on.

The WFD source may transmit a probe request frame including a WFD IE to the WFD sink, and the WFD sink may transmit a probe response frame including a WFD IE as a response to the probe request frame. In case the WFD device is associated with an infrastructure AP and operates as a Wi-Fi P2P device, a WFD IE and a P2P information element may be included in the probe request frame. The probe response frame, which is transmitted as a response to the probe request frame, may be transmitted through the channel through which the probe request frame was received and may include both the P2P IE and the WFD IE.

Details related to the WFD Device Discovery procedure that are not mentioned in this description may follow related documents, such as ‘Wi-Fi Display Technical Specification’ and/or ‘Wi-Fi Peer-to-Peer (P2P) Technical Specification Wi-Fi Direct Service Addendum’, and this may also be applied to the descriptions that follow.

In the procedure of WFD Service Discovery S402, a discovery procedure corresponding to service capability between the WFD source and the WFD sink, which have performed WFD Device Discovery, may be performed. For example, if the WFD source transmits a service discovery request frame, which includes information on WFD capability, the WFD sink may transmit a service discovery response frame, which includes information on the WFD capability, as a response to the service discovery request frame. The WFD service discovery procedure may correspond to an optional procedure.

In order to perform the WFD service discovery procedure, the probe request frame and the probe response frame, which are used in the WFD device discovery procedure, may include information indicating whether or not the WFD device is equipped with the capability supporting the service discovery procedure.

In the procedure of WFD Connection Setup S403, the WFD device, which has performed the WFD device discovery procedure, and which has optionally performed the WFD service discovery procedure, may select a WFD device for performing WFD connection setup. After the WFD device is selected for the WFD connection setup in accordance with the policy or user input, and so on, any one of the connectivity schemes between Wi-Fi P2P and tunneled direct link service (TDLS) may be used for WFD connection. The WFD devices may determine a connection method based on an associated basic service set identifier (BSSID) subelement, which is transported along with preferred connectivity information and WFD information elements.

FIG. 5 is a conceptual view illustrating a network between a WFD source and a WFD sink.

An upper part of FIG. 5 discloses a connection between a WFD source 500 and a WFD sink 510 that is based on Wi-Fi P2P, and a lower part of FIG. 5 discloses a connection between a WFD source 550 and a WFD sink 560 that is based on a TDLS link.

As shown in the upper part of FIG. 5, the AP may be common to both the WFD source 500 and the WFD sink 510, or the AP may be different for each of the WFD source 500 and the WFD sink 510. Alternatively, the AP may not exist. As shown in the lower part of FIG. 5, in case WFD connection is performed by using a TDLS link, the WFD source 550 and the WFD sink 560 shall maintain connection with the same AP.

The WFD capability exchange and negotiation procedure may be performed after performing the WFD connection setup procedure between the WFD devices. By performing the WFD capability exchange and negotiation procedure, the WFD source and the WFD sink may exchange at least one set of information on the codec supported by each of the WFD source and the WFD sink, profile information of the codec, level information of the codec, and resolution information. The WFD capability exchange and negotiation procedure may be performed by exchanging messages using a Real Time Streaming Protocol (RTSP). Also, a parameter set defining audio/video payload during a WFD session may be determined. The WFD capability exchange and negotiation procedure may be performed by exchanging messages starting from RTSP M1 to RTSP M4, as shown in FIG. 6.

The WFD session establishment procedure may be performed after the WFD exchange and negotiation procedure.

FIG. 6 is a conceptual view illustrating WFD capability exchange and negotiation procedures.

Referring to FIG. 6, the WFD source may transmit a RTSP M1 request message for initiating a RSTP procedure and WFD capability negotiation (step S601).

The RTSP M1 request message may include a RTSP OPTIONS request for determining a RTSP method set, which is supported by the WFD sink. The WFD sink that has received the RTSP M1 request message may transmit a RTSP M1 response message including a list of RTSP methods, which are supported by the WFD sink (step S602).

Subsequently, the WFD sink may transmit a RTSP M2 request message for determining a RTSP method set, which is supported by the WFD source (step S603).

When the RTSP M2 request message is received, the WFD source may respond to the request message by transmitting a RTSP M2 response message including a list of RTSP methods, which are supported by the WFD source (step S604).

The WFD source may transmit a RTSP M3 request message (RTSP GET_PARAMETER request message) specifying a list of WFD capabilities that the WFD source wishes to know (step S605).

When the RTSP M3 request message is received, the WFD sink may respond to the request message by transmitting a RTSP M3 response message (RTSP GET_PARAMETER response message) (step S606).

Based on the RTSP M3 response message, the WFD source may determine an optimal parameter set that is to be used during a WFD session, and, then, the WFD source may transmit a RTSP M4 request message (RTSP SET_PARAMETER request message), which includes the determined parameter set, to the WFD sink.

The WFD sink that has received the RTSP M4 request message may transmit a RTSP M4 response message (RTSP SET_PARAMETER response message) (step S607).

FIG. 7 is a conceptual view illustrating a WFD session establishment procedure.

In FIG. 7, the WFD sources/WFD sinks that have performed WFD capability exchange and negotiation may establish a WFD session. More specifically, the WFD source may transmit a RTSP SET parameter request message (RTSP M5 Trigger SETUP request) to the WFD sink S701.

The WFD sink may transmit a RTSP M5 response message as a response to the RTSP SET parameter request message (step S702).

If the RTSP M5 message including the trigger parameter SETUP is successfully exchanged, the WFD sink may transmit a RTSP SETUP request message (RTSP M6 request) to the WFD source (step S703).

If the RTSP M6 request message is received, the WFD source may respond to the request message by transmitting a RTSP SETUP response message (RTSP M6 response) (step S704).

A successful establishment of a RTSP session may be indicated by setting up a status code of the RTSP M6 response message.

After a successful exchange of the RTSP M6 message, the WFD sink may transmit a RTSP M7 request message to the source device in order to notify that the WFD sink is ready to receive RTP streams (step S705), and the WFD source may respond to the request message by transmitting a RTSP PLAY response message (RTSP M7 response message) (step S706). A successful establishment of a WFD session may be indicated based on the status code of a RTSP PLAY response message.

After a WFD session is established, the WFD source may transmit a RTSP M3 request message (RTSP GET_PARAMETER request message) for acquiring capability corresponding to at least one RTSP parameter that is supported by the WFD sink, a RTSP M4 request message for setting up at least one RTSP parameter value in order to perform capability re-negotiation between the WFD source and WFD sink for updating an Audio/Video (AV) format, a RTSP M5 request message triggering the WFD sink to transmit a RTSP PAUSE request message (RTSP M9 request message), a RTSP M12 request message indicating that the WFD source has entered a WFD standby mode, a RTSP M14 request message for selecting input type, input device, and other parameters that are to be used in a user input back channel (UIBC), or a RTSP M15 request message for enabling or disabling a user input back channel (UIBC), and so on, to the WFD sink. The WFD sink that has received the above-described RTSP request messages from the WFD source may respond to the received request messages by transmitting RTSP response messages.

Subsequently, the WFD sink may transmit a RTSP M7 request message (RTSP PLAY request message for initiating (or resuming) audio/video streaming, a RTSP M9 request message (RTSP PAUSE request message) for temporarily interrupting audio/video streaming that is being transmitted from the WFD source to the WFD sink, a RTSP M10 request message for requesting the WFD source to change the audio rending device, a RTSP M11 request message indicating a change in an active connector type, a RTSP M12 request message indicating that the WFD sink has entered the WFD standby mode, a M13 request message requesting the WFD source to refresh an instantaneous decoding refresh (IDR), a RTSP M14 request message for selecting input type, input device, and other parameters that are to be used in a UIBC, or a RTSP M15 request message for enabling or disabling a UIBC, and so on, to the WFD source. The WFD source that has received the RTSP request messages, which are listed above, may respond to the received request messages by transmitting RTSP response messages.

Once the WFD session is established and audio/video streaming is initiated, the WFD source and the WFD sink may carry out audio/video streaming by using a codec that is commonly supported by both the WFD source and the WFD sink. By using the codec that is commonly supported by both the WFD source and the WFD sink, interoperability between the WFD source and the WFD sink may be ensured.

WFD communication is based on a WFD IE, and a format of the WFD IE may be defined as shown below in Table 1.

TABLE 1 Size Value Field (octets) (Hexadecimal) Description Element 1 DD IEEE 802.11 vendor specific ID usage Length 1 Variable Length of the following fields in the IE in octets. The length field is variable and set to 4 plus the total length of WFD subelements. OUI 3 50-6F-9A WFA Specific OUI OUI Type 1 0A Identifying the type or version of the WFD IE. Setting to 0x0A indicates WFA WFD v1.0 WFD Variable One or more WFD subelements subelements appear in the WFD IE

The format is configured of an element ID field, a Length field, a WFD-specific OUI field, a OUI type field indicating a type/version of the WFD IE, and a WFD subelement field. The WFD subelement field is configured to have a format that is shown below in Table 2.

TABLE 2 Size Value Field (octets) (Hexadecimal) Description Subelement 1 Identifying the type of WFD ID subelement. The specific value is defined in Table 5-3. Length 2 Variable Length of the following fields in the subelement Subelements Variable Subelement specific body field information fields

The subelement ID field may be defined as shown below in Table 3.

TABLE 3 Subelement ID(Decimal) Notes 0 WFD Device Information 1 Associated BSSID 2 WFD Audio Formats 3 WFD Video Formats 4 WFD 3D Video Formats 5 WFD Content Protection 6 Coupled Sink Information 7 WFD Extended Capability 8 Local IP Address 9 WFD Session Information 10 Alternative MAC Address 11-255 Reserved

Referring to Table 3, the 1-octet subelement ID field may indicate which type of information is included in the WFD subelement. More specifically, the values 0, 1, . . . , 10 of the subelement ID field may respectively indicate that each of the subelements corresponds to a WFD Device Information subelement, an Associated BSSID subelement, a WFD Audio Formats subelement, a WFD Video Formats subelement, a WFD 3D Video Formats subelement, a WFD Content Protection subelement, a Coupled Sink Information subelement, a WFD Extended Capability subelement, a Local IP Address subelement, a WFD Session Information subelement, and an Alternative MAC Address subelement. Herein, the WFD Device Information subelement may include information that is required for determining whether or not to attempt pairing with a WFD device and session generation. The Associated BSSID subelement may be used for indicating the address of the currently associated AP. Each of the WFD Audio Formats subelement, the WFD Video Formats subelement, and the WFD 3D Video Formats subelement may be used to respectively indicate capabilities of the WFD device related to audio, video, and 3D video. The WFD Content Protection subelement transports information related to a content protection method, and the Coupled Sink Information subelement may deliver information related to the status of the coupled sink, the MAC address, and so on. The WFD Extended Capability subelement may be used for transporting diverse capability information of other WFD devices, and the Local IP Address subelement may be used for transporting an IP address to a WFD peer during a TDLS setup procedure. The WFD Session Information subelement may include information corresponding to a list information technicians of WFD device information within a WFD group. In case the WFD connection method requires an interface (e.g., MAC address) that is different from the interface used in the device discovery procedure, the Alternative MAC Address subelement may transport the related information.

Hereinafter, the exemplary embodiment of the present invention will disclose a method for transmitting information corresponding to user input (UI) and subtitles via WFD (or miracast).

Most particularly, each of a UI image/subtitle may be packetized, so as to be generated as a separate Packetized Elementary Stream (PES). Thereafter, the generated PES may be overlaid and multiplexed (MUXed) to a video stream through a multiplexer of an MPEG2-transport stream (TS) and may then be transmitted. The exemplary embodiment of the present invention will disclose a method for configuring (or setting up) UI image/subtitles transmitted by the WFD source and optimized by the WFD sink.

According to the exemplary embodiment of the present invention, the WFD source may recognize a native resolution of the WFD sink and may then transmit a UI stream and a subtitle stream having a resolution that is optimized to the WFD sink. Based on this method, the WFD sink may output a UI image/subtitle image having an optimal size within the display of the WFD sink on the display of the WFD sink without any degradation in picture quality.

Additionally, in case the UI image and the subtitle image overlays within the video image, the exemplary embodiment of the present invention will hereinafter disclose a method performed by the WFD sink of including information related to overlay in the UIBC data (or UIBC body format) and transmitting the processed UIBC data. The information related to the overlay may include an overlay index, a stream index, and so on. Based on the information related to the overlay, which is included in the UIBC data being transmitted through the UIBC, the WFD source may accurately recognize the intentions of the user's motions (or operations) performed on the WFD sink.

The stream, which is transmitted from the WFD source and multiplexed to a single MPEG2-TS, may include a plurality of audio streams/video streams, UI streams, and subtitle streams, and the WFD sink may perform demultiplexing on the stream, which is multiplexed as a single MPEG2-TS, and, then, the WFD sink may overlay the plurality of different streams by using a method that is referred to as alpha-blending. Thereafter, the WFD sink may output the overlaid streams to the display of the WFD sink.

The MPEG2-TS is merely an example of a transport stream for performing overlay multiplexing on a plurality of streams, and a transport stream of another format (or structure) may be used instead of the PEG2-TS for multiplexing and transmitting audio streams/video streams, UI streams, subtitle streams, and so on.

FIG. 8 is a conceptual view illustrating a method for transmitting and receiving a plurality of streams based on MPEG-TS according to an exemplary embodiment of the present invention.

Referring to FIG. 8, a video stream 800, an audio stream 810, a subtitle stream 820, and a user interface stream 830 may be processed with overlay multiplexing, thereby being generated as a MPEG2-TS 840.

The video stream 800 may include video data, the audio stream 810 may include audio data, the subtitle stream 820 may include subtitle data, and the user interface stream 830 may include data corresponding to the user interface.

The video stream 800, the audio stream 810, the subtitle stream 820, and the user interface stream 830 may correspond to individual elementary streams (ESs), and each of the ESs may have a different format, a different resolution, and a different bit rate. Each of the ESs may be packetized so as to be generated as a plurality of PESs, and each of the plurality of PESs may be processed with overlay multiplexing, thereby being generated as an MPEG2-TS.

The WFD source may transmit the MPEG2-TS 840 to the WFD sink based on WiFi Direct (WiFi peer to peer (P2P)).

The WFD sink may receive the MPEG2-TS 840 based on WiFi Direct. The MPEG2-TS 840 may be demultiplexed in the WFD sink and may then be split (or separated) to a video stream 850, an audio stream 860, a subtitle stream 870, and a user interface stream 880. The separated video stream 850, audio stream 860, subtitle stream 870, and user interface stream 880 play be played (or reproduced) based on alpha-blending (or overlaying).

In what follows, Miracast R2 (Release 2) will be described.

The Miracast R2 may improve Miracast user experience by introducing a method for triggering an adequate user behavior for proper operation of a Miracast service. In this respect, the present invention may provide useful information for a Miracast R2 user to improve user experience in the Miracast R2. The useful information for the Miracast R2 user may be utilized as a good example of the WFD R2 (Release 2). Since the WFD R2 may also be expressed as WFD2, the present document uses the terms of WFD R2 and WFD2 interchangeably. Also, since embodiments of the present invention are described with respect to the Miracast R2, WFD is used in such a way to include the WFD R2. Accordingly, a WFD source is used as a concept including a WFD R2 source, and a WFD sink is used as a concept including a WFD R2 sink. Here, a WFD sink may be divided into a primary sink and a secondary sink. As a function and service specified in the WFD, metadata operation is not supported in the Miracast R1 but is supported in the Miracast R2. In other words, all of a WFD source, WFD primary sink, and WFD secondary sink support the metadata operation only in the R2 specification.

A scenario for using useful information with respect to the Miracast R2 user is as follows. The user may display a live concert from a WFD R2 source to a WFD R2 sink. The WFD R2 sink includes information about remaining battery power of the WFD R2 source. During a WFD R2 session, the WFD R2 source may deliver battery information to the WFD R2 sink. If the user pushes a certain button on a remote controller, the battery information of the WFD R2 source (for example, information about the remaining battery power) may be displayed. The user may take an action on the basis of the battery information displayed (for example, charging the battery of the WFD R2 source, stopping playing an A/V stream, changing a refresh rate, or doing no particular action).

Also, another scenario for using useful information with respect to the Miracast R2 user is as follows. The user may display a live concert from a WFD R2 source to a WFD R2 sink. The WFD R2 sink may have AV contents information played in the WFD R2 source. During a WFD R2 session, the WFD R2 source may deliver AV contents information to the WFD R2 sink. The user may want to know additional information about the AV contents played on the WFD R2 sink. If the user pushes a certain button on a remote controller, contents related to the additional information of the WFD R2 source (for example, title, type, duration, resolution, size, and protect information) may be displayed on the screen of the WFD R2 sink.

Useful information with respect to the Miracast R2 user includes battery related information, streaming contents information, wireless quality related information, and background service information.

Battery related information gives information about remaining battery power (for example, 15% or 50%) or information about the type of power supply (for example, battery mode or action mode). An expected user action with respect to the information may include charging battery of the WFD R2 source, stopping playing of an A/V stream, changing a refresh rate, or doing no particular action. A format of the battery related information with respect to the Miracast R2 user may be defined as shown in Table 4 below.

TABLE 4 wfd2-battery = “wfd2_battery:” SP remaining-battery SP power-supply-type CRLF remaining-battery = 2*2HEXDIG: remaining battery capacity in units of %. Power-supply-type = “BatteryMode” / “Charging” /“ACMode”: If the Power-supply-type field is “Charging”, the remaining-battery field is set to the current percentage of WFD R2 Source. If the Power-supply-type is “ACMode”, the remaining-battery field is set to 100%.

Streaming contents information includes title, type (video or mp4), duration, resolution, size, and protect information (about whether premium contents are being streamed). URL (Uniform Resource Locator) information (target URL) may be provided from external contents (for example, you-tube and MLB TV live) of the WFD R2 source. An expected user action with respect to the information may include simply providing useful contents or taking various actions according to the URL information of AV contents. The user may obtain other AV contents by connecting to a website (which displays an URL of the WFD R2 sink). A format of the streaming contents information with respect to the Miracast R2 user may be defined as shown in Table 5 below.

TABLE 5 wfd2-streaming-contents-info = “wfd2_streaming_contents_info:” SP title SP time SP type SP duration SP resolution SP size SP protect CRLF title = 255*255HEXDIG; Specifying the title of the streaming contents. type = 2*2HEXDIG; specifying video codec. duration = 8*8HEXDIG; specifying total duration of media in seconds resolution = 8*8HEXDIG; detailing resolution supported by the video codec size = 8*8HEXDIG; specifying total size of the media file in bytes protect = “Premium” / “NonPremium”

Wireless quality related information may include channel quality, latency information, connection type (Wi-Fi Direct, WLAN infrastructure, and so on), and AP information (SSID). How a WFD R2 source measures such information is beyond the specified scope of this document. An expected user action with respect to the information may include the user's switching the connection type according to the wireless quality information displayed with respect to the WFD R2 sink (from Wi-Fi Direct to WLAN infrastructure or vice versa). Also, if the user finds out that the current wireless quality is not in a good condition from the displayed information with respect to the WFD R2 sink, the user may relocate the position of a device (WFD R2 source, WFD R2 sink, AP) to adjust the line of sight of the device. A format of the wireless quality related information with respect to the Miracast R2 user may be defined as shown in Table 6 below.

TABLE 6 wfd2-wireless-quality-info = “wfd2_wireless_quality_info:” SP rcpi SP rsni SP jitter SP connection-type SP ssid CRLF rcpi-between-source-sink = 1*1HEXDIG; Indicates the received frame power level at the WFD R2 Source. Unit is dB. rsni-between-source-sink = 1*1HEXDIG; Indicates the received signal to noise indicator at the WFD R2 Source. Unit is dB. jitter = 1*1HEXDIG; Indicates a variation in packet transit delay. Unit is ms. Connection-type = “wifi-direct” / “wlan-infrastructure”; Indicates current connection type between WFD R2 Source and WFD R2 Sink. ssid = 16*16HEXDIG; Indicates the identity of an ESS. If the connectivity type bitmap is set to 0, SSID value is set to 0. rcpi-between-source-ap = 1*1HEXDIG; Indicates the received frame power level at the WFD R2 Source. Unit is dB. Rsni-between-source-ap = 1*1HEXDIG; Indicates the received frame power level at the WFD R2 Source. Unit is dB.

The background service information includes a list of background services (namely, all of the services operating at the WFD R2 source except for the applications of the WFD R2 source displayed on the WFD R2 sink). Through an expected user action with respect to the information, a device may know the list of all of the services operating in the WFD R2 source through the screen of the WFD R2 sink. Also, based on the information, the user may take various actions. A format of the source service information with respect to the Miracast R2 user may be defined as shown in Table 7 below.

TABLE 7 wfd2-source-service-info  = “wfd2_source_service_info:” SP  number-of- services SP service-name CRLF Number-of-services = 4*4HEXDIG; Identifying the number of services (or applications) that is running in WFD R2 source except WFD R2 source's service (or application) that is displaying in WFD R2 Sink. Service-name = 255*255HEXDIG; Identifying service title that is running in WFD R2 source except WFD R2 source's service that is displaying in WFD R2 Sink. e.g., Internet explorer, Microsoft word, WebEx, etc. The number of this field is “Number of Services * Service title field”.

The present invention may define a new RTSP message for delivering useful information with respect to the Miracast R2 user. In other words, the new RTSP message includes a format about battery related information, format about streaming contents, format about wireless quality related information, and format about background service information. A WFD R2 source may transmit an RTSP M17 SET_PARAMETER request message to a WFD R2 sink, and the WFD R2 source may receive an RTSP M17 response message (with okay) from the WFD R2 sink in response to the message. The RTSP M17 SET_PARAMETER request message transmitted by the WFD R2 source may include WFD R2 battery information, information about WFD R2 streaming contents, information about WFD R2 wireless quality, and information about a WFD R2 source background service.

Also, the present invention proposes transmitting additional information together with the information defined by the format about streaming contents information by embedding the information into the metadata rather than the RTSP message. The information defined in the format about streaming contents information and additional information may be defined as shown in Table 8 below.

TABLE 8 - Title: Specifying the title of the streaming contents. - Target-url: Specifying the target url of streaming content. - Type: specifying video codec. - Duration: specifying total duration of media in seconds - Current-playing-time: current playing time. - Remaining-duration: remaining playback time from the current playing time - Resolution: detailing resolution supported by the video codec. - Size: specifying total size of the media file in bytes - Protect: support for Protection. “Protected” / “unprotected” - Channel: information of a channel through which a stream is transmitted (e.g., 2.4GH, channel number 6) - RSSI-between source-and-sink: Indicates the Received signal strength indication at the WFD R2 Source - RSNI-between source-and-sink: Indicates the received signal to noise indicator at the WFD R2 Source - RCPI-between source-and-sink: Indicates the received channel power level at the WFD R2 Source. Unit is dB. - RSSI-between source-and-ap: Indicates the Received signal strength indication at the WFD R2 Source - RSNI-between source-and-ap: Indicates the received signal to noise indicator at the WFD R2 Source - RCPI-between source-and-ap: Indicates the received channel power level at the WFD R2 Source. Unit is dB. - Streaming-transmission-type: method for transmitting a media file “Wi-Fi Direct” or “DLNA”.

Metadata employed in this document is such kind of data for describing or defining data, which is configured with respect to the contents according to a predetermined rule. Metadata may be used for expressing the corresponding contents and finding the corresponding contents promptly.

For example, metadata may include creation date/time information of the corresponding contents for which the metadata is configured, information of date/time at which the contents are uploaded to a contents management server, information of a creator of the corresponding information (human or device information), upload subject information, information of the weather (temperature, humidity, and the like) when the corresponding contents are created, and device location information when the corresponding contents are created.

Furthermore, metadata may include a comment on the corresponding contents, information of a specific device (phone number, blog address, ID registered in a social network service, and so on), history of uses of the corresponding contents (information of devices which have used the corresponding contents), information from a web server acquired by using the corresponding contents or a specific object belonging to the contents.

Contents which may be configured by metadata according to the present invention may include various types of contents such as images (still images and videos), audio, document files, flash files, applications, web documents, and web pages. It should be noted that the metadata are not always limited to the examples above.

Furthermore, when the corresponding contents of the metadata are images, metadata may be configured not only for the image itself or a specific object included in the image but also for a specific still image among a plurality of still images included in the contents if the contents correspond to a video.

Also, if it is assumed that the corresponding contents of the metadata are images, the metadata may include a specific object itself included in the image, position information of the specific object included in the image, gender indices of people included in the image (distribution of males and females), activity indices (or distribution of motions) of objects included in the image, and indices of colors included in the image (or distribution of colors).

Also, if it is assumed that the corresponding content of the metadata is a video, the metadata may include motion of interest (MOI) of a specific object belonging to the video during a predetermined time period and mark information of a particular section (for example, a section spanning from 30 seconds to 1 minute since the start of play) within the video.

Also, metadata may include identification information of contents for which the metadata is configured, and the corresponding contents may include identification information of the metadata configured for the contents. This is so intended that metadata is used to find the contents for which the metadata are configured, and contents are used to find the metadata configured for the contents.

Also, metadata may be stored and managed separately from the contents for which the metadata are configured or stored and managed by being included in the contents for which the metadata are configured. It should be noted that metadata are not limited to the examples above but may be configured to have more various types of information with respect to the corresponding contents.

Also, the present invention proposes to include the battery information, streaming contents information, wireless quality information, and source service information in the metadata and to deliver the metadata according to the RTSP message transmission scheme. The RTSP message transmission scheme is illustrated in FIG. 9. The metadata format is based on the ITU-T Recommendation H.222.0 specification.

As described above, the WFD source, WFD primary sink, and WFD secondary sink are the functions and services specified in the WFD, and metadata operation is not supported in the R1 but supported in the R2 specification.

At this time, wfd2-metadata-capability is used to indicate whether WFD R2 metadata operation is supported. The wfd2-metadata-capability may be formatted as shown in Table 9 below.

TABLE 9 wfd2-metadata-capability = “wfd2_metadata_capability:” SP metadata-cap CRLF; metadata-cap = “none” / “supported”; “none” if not supported

The value of “supported” represents that a WFD R2 device is capable of transmitting metadata of media contents and receiving metadata from a peer WFD R2 device. If an RTSP M3 request message includes the wfd2-metadata-capability parameter, and the WFD R2 sink does not support the features of metadata, the WFD R2 sink has to respond to the parameter by sending “none” to the RTSP M3 response message. Among the parameters included in the RTSP message, the wfd2-metadata-capability is illustrated in Table 10 below.

TABLE 10 Parameter Parameter name description Method Definition Usage wfd2-metadata- Indicate the GET 6.1.21 Optional in capability support for RTSP M3 metadata request. function using Mandatory in RTSP. RTSP M3 response if RTSP M3 request includes it.

FIG. 9 is a flow diagram of including information useful for a user of the Miracast R2 into metadata and delivering the metadata through the RTSP message transmission scheme according to an embodiment of the present invention.

For example, while a WFD R2 source is streaming media contents being played therein directly to a WFD R2 sink, a Miracast user may push the button of a remote controller of the WFD R2 sink to obtain detailed information of the media contents. At this time, the WFD R2 sink may transmit the RTSP M5 message 910 to the WFD R2 source and trigger transmission of metadata including detailed information of the media contents.

More specifically, the WFD R2 sink may deliver a request for transmission of metadata including information of contents currently being streamed to the WFD R2 source, which is the corresponding peer, through the RTSP M5 message 910. If the WFD R2 source receives the RTSP M5 message 910 with respect to the triggering of metadata transmission from the WFD R2 sink, the WFD R2 source may include metadata in the RTSP M17 SET_PARAMETER request message 920 and transmit the message to the WFD R2 sink. If receiving the RTSP M17 SET_PARAMETER request message 920 including metadata from the WFD R2 source, the WFD R2 sink transmits the RSTP M17 response message 930 with “status=OK” in response to the reception.

In what follows, a method for delivering information required for performing Miracast R2 through metadata in the WFD R2 will be described.

Direct streaming mode in the WFD R2 supports inclusion of WFD R2 metadata. WFD R2 metadata is delivered through a stream multiplexed according to the MPEG2-TS. WFD R2 metadata may include information of Tables 11 and 12 below. Table 11 illustrates information about video contents for the WFD R2, and Table 12 illustrates information about audio contents for the WFD R2. A WFD R2 device may deliver WFD R2 metadata to its peer WFD R2 device.

TABLE 11 Name Description Author Identifying the author or organization name of the content Title Identifying the title of the video content Codec Identifying the video codex of the media content Duration Identifying the total duration of the video content in seconds (e.g., hh:mm:ss) Resolution Identifying the resolution of the video content Refresh rate Identifying the refresh rate of the video content (e.g., p60) Size Identifying the total size of the video contents in bytes Protection Identifying whether media content is protected or not (e.g., “Protected” or “Unprotected”) URL Identifying the target URL of the media content (e.g., http://m.mlb.com/video)

TABLE 12 Name Description Title Identifying the title of the music content Artist Identifying the artist name Album Identifying the album name Year A four-digit year Comment The comment Track The number of the track on the album Genre Index in a list of genres

Information shown in Table 11 and 12 may be formatted as shown in Table 13.

TABLE 13 metadata_record( ) { for (i=0; i < N; i++) { metadata_name_length (8 bits. Specifies length of metadata_name, in bytes.) metadata_name (variable size US-ASCII string) metadata_value_length (8 bits. Specifies length of metadata_value, in bytes.) metadata_value (variable size UTF-8 string) } }

US-ASCII is a traditional character encoding method based on the English alphabet. UTF-8 string is a variable length character encoding method for Unicode. Therefore, metadata_name based on the alphabet may be formatted by the US-ASCII string while metadata_value based on the Unicode may be formatted by the UTF-8 string.

As another embodiment, information of Table 11 may be formatted as shown in Table 14.

TABLE 14 metadata_record( ) { for (i=0; i < N; i++) { metadata_author_length (8bits) metadata_author (variable size UTF-8 string, length specified by metadata_author_length) metadata_title_length (8bits) metadata_title (variable size UTF-8 string, length specified by metadata_title_length) metadata_codec_length (8bits) metadata_codec (variable size UTF-8 string, length specified by metadata_codec_length) metadata_duration_length (8bits) metadata_duration (variable size UTF-8 string, length specified by metadata_duration_length) metadata_resolution_length (8bits) metadata_resolution (variable size UTF-8 string, length specified by metadata_resolution_length) metadata_refresh_rate_length (8bits) metadata_refresh_rate (variable size UTF-8 string, length specified by metadata_refresh_rate_length) metadata_size_length (8bits) metadata_size (variable size UTF-8 string, length specified by metadata_size_length) metadata_protection_length (8bits) metadata_protection (variable size UTF-8 string, length specified by metadata_protection_length) metadata_url_length (8bits) metadata_url (variable size UTF-8 string, length specified by metadata_url_length) } }

As another embodiment, information of Table 12 may be formatted as shown in Table 15.

TABLE 15 metadata_record( ) { for (i=0; i < N; i++) { metadata_title_length (8bits) metadata_title (variable size UTF-8 string, length specified by metadata_title_length) metadata_artist_length (8bits) metadata_artist (variable size UTF-8 string, length specified by metadata_artist_length) metadata_album_length (8bits) metadata_album (variable size UTF-8 string, length specified by metadata_album_length) metadata_year_length (8bits) metadata_year (variable size UTF-8 string, length specified by metadata_year_length) metadata_comment_length (8bits) metadata_comment (variable size UTF-8 string, length specified by metadata_comment_length) metadata_track_length (8bits) metadata_track (variable size UTF-8 string, length specified by metadata_track_length) metadata_genre_length (8bits) metadata_genre (variable size UTF-8 string, length specified by metadata_genre_length) } }

As a yet another embodiment of the present invention, information of Tables 11 and 12 may be formatted as shown in Table 16 below.

TABLE 16 Metadata_record( ) { for (i=0; i < N; i++) { metadata_name_length (8 bits) metadata_name (variable size US-ASCII string, length specified by metadata_name_length) metadata_value_length (8 bits) metadata_value (variable size UTF-8 string, length specified by metadata_value_length) } }

Referring to Table 16, when all of the information of Table 11 (where N represents the number of metadata information elements included in Table 11) is transmitted by being included in the metadata, the metadata is transmitted after being formatted in the US-ASCII string and the UTF-8 string format. As another embodiment, when only part (for example, title, duration, codec, and time) of the metadata information included in Table 11 is transmitted during transmission of a video stream, only the part of information (for example, title, duration, codec, time) may be included in the metadata record. However, when metadata information is newly added, the value of N may be changed, and transmission may be performed in a flexible manner.

Also, the WFD R2 metadata may include the information shown in Table 17 below. Table 17 shows a text format for the WFD R2 (for example, subtitle of video image).

TABLE 17 Name Description Subtitle(text format) Text format information included in the image (caption) UTF-8 string or MPEG-4 part 17 text stream or CEA 608/708

As an example, information of Table 17 may be formatted as shown in Table 18 below.

TABLE 18 metadata_record( ) { metadata_subtitle_length (8bits) metadata_subtitle (variable size UTF-8 string or MPEG-4 part17 text stream or CEA 608/708, length specified by metadata_subtitle_length) }

Also, the WFD R2 metadata may include the information of Table 19 shown below. Table 19 shows information about graphic data (for example, image or caption) for the WFD R2.

TABLE 19 Name Description Graphic or image Object image included in an image (e.g, status bar of a media player, progress bar) or Subtitle of the image format (JPEG, PNG) included in the image

As one example, information of Table 19 may be formatted as shown in Table 20 below.

TABLE 20 metadata_record( ) { JPEG_descriptor_length (8bits) JPEG_descriptor( ) { descriptor_tag descriptor_length image_width image_height color_space reserved for (i = 0; i < N; i++) { private_data_byte } } } Or metadata_record( ) { PNG_descriptor_length (8bits) PNG_descriptor( ) { descriptor_tag descriptor_length image_width image_height PNG_image_type bit_depth for (i = 0; i < N; i++) { private_data_byte } }

WFD R2 metadata is delivered in a private stream but not in a stream formatted with MAUs (Metadata Access Units). At this time, the stream_id value used for streaming is private_stream_id_1, 0xbd. stream_type is set to 0x15, which indicates delivery of the WFD R2 metadata in a PES stream.

In what follows, a metadata descriptor used for the WFD R2 and PES stream format will be described.

—Descriptor loop of PMT (Program Map Table) for Program—

To declare existence of a metadata stream, metadata_pointer_descriptor is placed in the PMT within a program_info loop for a program. The WFD R2 described in Tables 11 and 12 defines a new metadata format (metadata_application_format and metadata_format) for the delivery of WFD R2 metadata together with related information. If metadata_application_format is set to 0xFF10, and metadata_format is set to 0xC8, it indicates that the metadata format is used for delivery of the WFD R2 metadata contents. The value of metadata_application_format for “User defined” case ranges from 0x100 to 0xFFFE while the value of metadata_format for “Private use” case ranges from 0x40 to 0xFE. The present invention may define a new format for the WFD R2 from among those values (namely the value of metadata_application_format ranges from 0x100 to 0xFFFE while the value of metadata_format ranges from 0x40 to 0xFE). metadata_pointer_descriptor may include the information of Table 21 below.

Table 21 Syntax Value Metadata_pointer_descriptor( ){ descriptor_tag 37 (decimal) - Metadata_pointer_descriptor tag descriptor_length The length of the descriptor metadata_application_format 0xFF10 (for WFD R2) metadata_format 0xC8 (for WFD R2) metadata_service_id Any ID, typically 0 MPEG_carriage_flags 0 reserved 0x1f if(MPEG_carriage_flags == 0|1|2){ program_number program number of the program whose es descriptor loop contains the metadata_descriptor } }

It is necessary to declare the elementary stream delivered by metadata within the loop of the elementary stream of the program map. The elementary stream parameter may include the information shown in Table 22 below.

TABLE 22 Field Value stream_type 0x15 reserved 0x07 elementary_PID pid of the elementary stream carrying the metadata reserved 0xf ES_info_length length of the elementary stream infor descriptor loop, including the metadata_descriptor

—Descriptor Loop of the PMT for Elementary Stream—

To declare the format of the metadata stream, metadata_descriptor is placed in the PMT within es_info loop for elementary stream. metadata_descriptor may include the information of Table 23 shown below.

TABLE 23 Syntax Value Metadata_descriptor( ){ descriptor_tag 38 (decimal) - Metadata_descriptor tag descriptor_length The length of the descriptor metadata_application_format 0xFF10 (for WFD R2) metadata_format 0xC8 (for WFD R2) metadata_service_id Any ID, typically 0 decoder_config_floags 0 DSM-CC_flag 0 reserved 0xf

—PES Stream Format—

The metadata including information of Table 11 and 12 may be stored in a PES packet as a full WFD R2 frame. A full WFD R2 frame includes a full WFD R2 header. In other words, the WFD R2 frame is used for transmitting metadata including information of Table 11 and 12. The WFD R2 header may be formatted as shown in Table 24 below.

TABLE 24 wfd2_metadata_header( ) { wfd2_tag (24 bits) wfd2_data_structure_type (1 bit) reserved (6 bits) wfd2_data_update_indicator (1 bit) metadata_record_size (16 bits) }

According to Table 24, wfd2_metadata_header( ) includes wfd2_tag (24 bit), wfd2_data_structure_type (1 bit), reserved (6 bit), wfd2_data_update_indicator (1 bit), and metadata_record_size (16 bit).

wfd2_tag is a 3 byte field including a WFD-specific OUI (Organizationally Unique Identifier), and wfd2_tag may be used for identifying a WFD R2 frame delivering metadata.

wfd2_data_structure_type is a 1 bit field for identifying the metadata structure, where if wfd2_data_structure_type is 0, it indicates a video metadata structure, and if wfd2_data_structure_type is 1, it indicates an audio metadata structure.

Also, wfd2_metadata_header( ) includes reserved bits for future use, which all have to be set to zero.

wfd2_data_update_indicator is a 1 bit field indicating an update type, where, if wfd2_data_update_indicator is 0, it indicates a full update, and if wfd2_data_update_indicator is 1, it indicates an incremental update. A full update represents replacing all of previous metadata_records with new metadata_records. An incremental update represents updating previous metadata_records together with updated metadata_records. In other words, only updated metadata_records may be included in a WFD R2 frame.

metadata_record_size uses 16 bits and represents the length of the metadata_record.

Also, wfd2_metadata_header( ) may also include wfd2_header_length field representing the length of a descriptor.

Also, metadata including information of Table 17 and 19 may be stored in a PES packet as a full WFD R2 frame. A full WFD R2 frame includes a full WFD R2 header. In other words, a WFD R2 frame is intended for transmitting metadata including the information of Table 17 and 19. The WFD R2 header may be formatted as shown in Table 25 below.

TABLE 25 wfd2_metadata_header( ) { wfd2_tag wfd2_data_structure_tag Country code wfd2_subtitle_info_length }

According to Table 25, wfd2_metadata_header( ) includes wfd2_tag, wfd2_data_structure_tag, Country code, and wfd2_subtitle_info_length.

wfd2_tag is a 3 byte field including a WFD-specific OUI (Organizationally Unique Identifier), and wfd2_tag is used for identifying a WFD R2 frame delivering metadata.

Different from Table 24, wfd2_data_structure_tag is a 2 bit field for identifying a metadata structure, where, if wfd2_data_structure_tag is 0, it indicates a video metadata structure, and if wfd2_data_structure_tag is 1, it indicates an audio metadata structure. At this time, when metadata include information of Table 17, if wfd2_data_structure_tag is 2, it indicates auxiliary content metadata structure with respect to a text format (subtitle or other format). When metadata includes information of Table 19, if wfd2_data_structure_tag is 2, it indicates auxiliary content metadata structure with respect to graphic data. If wfd2_data_structure_tag is 3, it indicates reserved bits for later use.

Country code specifies the language supported in the subtitle. When the corresponding field (Country code field) is not used, it is filled with zeros.

wfd2_subtitle_infor_length specifies the length of a WFD R2 frame body. At this time, the WFD R2 frame includes subtitle metadata information.

A WFD R2 tag has to be started immediately after the PES header. A PES header has to include PTS (Presentation Time Setup). PTS has to be synchronized with an audio and a video frame (it should be on the same timeline). data_alignment bit has to be 1. The PES header has to include PES_packet_length rather than 0. The PES header may be formatted as shown in Table 26 below.

TABLE 26 PES Syntax Value PES_Packet( ){ packet_start_code_prefix 0x00 0x00 0x01 stream_id 0xbd - private_stream_id_1 PES_packet_length Length of the packet, which must not be 0 if(...){ a large test which is true in this case ‘10’ ‘10’ PES_scrambling_control 0 PES_priority 0 data_alignment_indicator 1 for the packet containing start of the WFD2 header, else 0 copyright 0 original_or_copy 0 PTS_DTS_flag if(data_alignment ==1) ‘10’ else ‘00’ ESCR_flag 0 ES_rate_flag 0 DSM_trick_mode_flag 0 additional_copy_info_flag 0 PES_CRC_flag 0 PES_extension_flag 0 PES_header_data_length the length of the data; padding may be used } }

Also, a metadata stream may be transmitted by being included in a transport stream in the same manner as an audio or a video stream. For example, only when the metadata stream is accompanied by a PES header, payload_unit_start_indicator may be set to 1 within transport_packet( ). The PES header may indicate whether WFD R2 data starts subsequently or the WFD R2 data is divided into multiple PES packets. In what follows, PES packets will be described with reference to FIGS. 10 and 11.

FIG. 10 illustrates a method for packetizing a plurality of streams based on the MPEG2-TS and transmitting the packetized streams according to an embodiment of the present invention.

FIG. 10 illustrates a transmitter which packetizes a plurality of stream according to the WFD R2 and transmitting the packetized streams. The transmitter may correspond to a WFD R2 source. The transmitter may accumulate and multiplex a video stream 1010, audio stream 1020, auxiliary stream 1030, and metadata stream 1040; and generate an MPEG2-TS 1060. The MPEG2-TS 1060 may correspond to a transport stream.

The video stream 1010 may include video data, audio stream 2010 may include audio data, auxiliary stream 1030 may include auxiliary data, and metadata stream 1040 may include metadata.

The video stream 1010, audio stream 1020, auxiliary stream 1030, and metadata stream 1040 may be individual ESs (Elementary Streams), and each of a plurality of ESs may have different formats, different resolutions, and different bit rates. Each of a plurality of ESs may be packetized through the PES packetizer 1050 and generated as an individual PES; and each of a plurality of PESs may be accumulated and multiplexed to be generated as MPEG2-TS 1060. In other words, the video stream 1010 generates a PES with respect to the video stream through the video PES packetizer 1050, the audio stream 1020 generates a PES with respect to the audio stream through the audio PES packetizer 1050, the auxiliary stream 1030 generates a PES with respect to the auxiliary stream through the auxiliary PES packetizer 1050, and metadata stream 1040 generates a PES with respect to the metadata stream through the metadata PES packetizer.

The WFD R2 source may transmit the generated MPEG2-TS 1060 to the WFD R2 sink according to the WFD R2 (or Miracast R2).

FIG. 11 illustrates a structure of a PES packet according to an embodiment of the present invention.

Referring to FIG. 11, a PES packet includes a Packet Start Code Prefix (in what follows, PSCP), Stream ID, PES Packet Length and PES Packet Data (PES payload). PSCP and Stream ID are used to determine a scramble channel PES Packet Length is used to represent the length of a PES packet. PES Packet Data are used for storing data of image information. PSCP, Stream ID, and PES Packet Length may be called a PES header.

FIG. 12 is a flow diagram illustrating a procedure of transmitting metadata in the WFD R2 according to an embodiment of the present invention.

First, to clarify terms, since WFD R2 may also be expressed as WFD2, this document uses the terms of WFD R2 and WFD2 interchangeably. Also, since embodiments according to the present invention are described based on the Miracast R2, WFD is used as a concept including the WFD R2. Accordingly, a WFD source is employed as a concept including a WFD R2 source while a WFD sink is employed as a concept including a WFD R2 sink.

In the S1210 step, a WFD source constitutes a WFD frame which includes a WFD header and metadata.

In the S1220 step, A WFD source transmits a packet stream, which is a stream created by packetizing a metadata stream including the WFD frame, to a WFD sink. A packet stream may correspond to a PES or PES packets.

The WFD header includes a metadata structure field and a metadata update indicator. The metadata structure field may correspond to wfd2_data_structure_type of Table 24, and the metadata update indicator may correspond to wfd2_data_update_indicator of Table 24.

The metadata structure field indicates whether metadata is configured by a video metadata structure or an audio metadata structure. In other words, metadata structure may be distinguished by the wfd2_data_structure_type value of Table 24.

The metadata update indicator indicates whether a first update or a second update is performed on the metadata. In other words, an update method may be chosen according to the value set to the wfd2_data_structure_type of Table 24. The first update may correspond to a full update in which all of the previous metadata is replaced with new metadata. The second update may correspond to an incremental update in which updated metadata is used to update previous metadata incrementally. In other words, different from the first update, if the second update is performed, only the updated metadata may be included in the WFD frame. At this time, metadata may correspond to metadata record.

Also, a metadata stream is determined by a metadata descriptor. A metadata descriptor includes a metadata application format configured by a predetermined first value and a metadata format configured by a predetermined second value. According to an embodiment of the present invention, the value of the metadata application format may be set to 0xFF10 corresponding to the predetermined first value while the value of the metadata format may be set to 0xC8 corresponding to the predetermined second value.

Also, metadata includes information about video contents and information about audio contents. Information about video contents may correspond to Table 11 while information about audio contents may correspond to Table 12. Information about video contents may include protection information indicating whether video contents are protected and URL information of the video contents. The protection information may correspond to Protection of Table 11, and the URL information may correspond to URL of Table 11.

Also, the information about video contents and the information about audio contents may be formatted by the US-ASCII string and UTF-8 string. The US-ASCII string and UTF-8 string may correspond to the scheme of Table 13.

Also, metadata may include information about a text format and/or information about graphic data. Information about the text format may correspond to Table 17, and information about graphic data may correspond to Table 19. If metadata includes information about a text format and/or information about graphic data, the WFD R2 header of the WFD R2 frame, which is a frame for transmitting the metadata, may be formatted as shown in Table 25.

Also, a metadata stream may be multiplexed with an audio stream or a video stream and transmitted as a transport stream. In the WFD R2, an auxiliary stream and/or metadata stream may be may be multiplexed together with an audio stream and/or video stream in the MPEG2-TS and transmitted as a transport stream. At this time, each stream may be packetized through a PES packetizer and generated for each of a plurality of PES. In other words, a video stream generates a PES with respect to a video stream through a video PES packetizer, an audio stream generates a PES with respect to an audio stream through an audio PES packetizer, an auxiliary stream generates a PES with respect to an auxiliary stream through an auxiliary PES packetizer, and a metadata stream generates a PES with respect to a metadata stream through a metadata PES packetizer.

Through the method described above, a WFD source may transmit metadata to a WFD sink.

FIG. 13 is a block view illustrating a wireless device to which the exemplary embodiment of the present invention can be applied.

Referring to FIG. 13, the wireless device may correspond to a WFD source 1300 and a WFD sink 1350 that can embody the above-described exemplary embodiment of the present invention.

The WFD source 1300 includes a processor 1310, a memory 1320, and a radio frequency (RF) unit 1330.

The RF unit 1330 is connected to the processor 1310, thereby being capable of transmitting and/or receiving radio signals.

The processor 1310 implements the functions, processes, and/or methods proposed in the present invention. For example, the processor 1310 may be implemented to perform the operations of the WFD source 1300 according to the above-described exemplary embodiments of the present invention. The processor may perform the operations of the WFD source 1300, which are disclosed in the exemplary embodiments of FIG. 1 to FIG. 12.

The WFD sink 1350 includes a processor 1360, a memory 1370, and a radio frequency (RF) unit 1380.

The RF unit 1380 is connected to the processor 1360, thereby being capable of transmitting and/or receiving radio signals.

The processor 1360 implements the functions, processes, and/or methods proposed in the present invention. For example, the processor 1360 may be implemented to perform the operations of the WFD sink 1350 according to the above-described exemplary embodiments of the present invention. The processor may perform the operations of the WFD sink 1350, which are disclosed in the exemplary embodiments of FIG. 1 to FIG. 12.

For example, if the battery level indicated by battery status information included in the RTSP message received from a different WFD device is lower than a predetermined threshold value, the processor 1360 may be configured to transmit an RTSP message which commands the different WFD device to operate in the power save mode.

The processor 1310 and 1360 may include an application-specific integrated circuit (ASIC), another chip set, a logical circuit, a data processing device, and/or a converter converting a baseband signal and a radio signal to and from one another. The memory 1320 and 1370 may include a read-only memory (ROM), a random access memory (RAM), a flash memory, a memory card, a storage medium, and/or another storage device. The RF unit 1330 and 1380 may include one or more antennas transmitting and/or receiving radio signals.

When the exemplary embodiment is implemented as software, the above-described method may be implemented as a module (process, function, and so on) performing the above-described functions. The module may be stored in the memory 1320 and 1370 and may be executed by the processor 1310 and 1360. The memory 1320 and 1370 may be located inside or outside of the processor 1310 and 1360 and may be connected to the processor 1310 and 1360 through a diversity of well-known means. 

What is claimed is:
 1. A method for transmitting metadata in WFD (WiFi Display), the method comprising: configuring, by a WFD source, a WFD frame comprising a WFD header and metadata; and transmitting, by the WFD source, a transport stream including the WFD frame, to a WFD sink, wherein the transport stream is a stream in which first to fourth packet streams are multiplexed by a Moving Picture Experts Group2-Transport Stream (MPEG2-TS), wherein the first packet stream is a stream in which a video stream is packetized by a first Packetized Elementary Stream (PES) packetizer, wherein the second packet stream is a stream in which an audio stream is packetized by a second PES packetizer, wherein the third packet stream is a stream in which an auxiliary stream is packetized by a third PES packetizer, wherein the fourth packet stream is a stream in which a metadata stream is packetized by a fourth PES packetizer, wherein the WFD header comprises a metadata structure field and a metadata update indicator, wherein the metadata structure field indicates whether the metadata holds a video metadata structure or an audio metadata structure, wherein the metadata update indicator indicates whether a first type update or a second type update is performed on the metadata, wherein the first type update is performed to replace all of the previous metadata with new metadata, and wherein the second type update is performed to update previous metadata incrementally by using updated metadata.
 2. The method of claim 1, wherein the metadata stream is determined by a metadata descriptor, and the metadata descriptor comprises a metadata application format configured by a predetermined first value and a metadata format configured by a predetermined second value.
 3. The method of claim 1, wherein the metadata comprises information about video content and information about audio content.
 4. The method of claim 3, wherein the information about the video content comprises protection information indicating whether the video content is protected and URL information of the video content.
 5. The method of claim 3, wherein the information about the video content and the information about the audio content are formatted by a US-ASCII string and a UTF-8 string.
 6. A WFD source configured to transmit metadata in WFD (WiFi display), the WFD source comprising: a transmitter; and a processor operatively coupled to the transmitter, wherein the processor is configured to: configure a WFD frame comprising a WFD header and metadata, and transmit, via the transmitter, a transport stream including the WFD frame, to a WFD sink, wherein the transport stream is a stream in which first to fourth packet streams are multiplexed by a Moving Picture Experts Group2-Transport Stream (MPEG2-TS), wherein the first packet stream is a stream in which a video stream is packetized by a first Packetized Elementary Stream (PES) packetizer, wherein the second packet stream is a stream in which an audio stream is packetized by a second PES packetizer, wherein the third packet stream is a stream in which an auxiliary stream is packetized by a third PES packetizer, wherein the fourth packet stream is a stream in which a metadata stream is packetized by a fourth PES packetizer, wherein the WFD header comprises a metadata structure field and a metadata update indicator, wherein the metadata structure field indicates whether the metadata holds a video metadata structure or an audio metadata structure, wherein the metadata update indicator indicates whether a first type update or a second type update is performed on the metadata, wherein the first type update is performed to replace all of the previous metadata with new metadata, and wherein the second type update is performed to update previous metadata incrementally by using updated metadata.
 7. The WFD source of claim 6, wherein the metadata stream is determined by a metadata descriptor and the metadata descriptor comprises a metadata application format configured by a predetermined first value and a metadata format configured by a predetermined second value.
 8. The WFD source of claim 7, wherein the metadata comprises information about video content and information about audio content.
 9. The WFD source of claim 7, wherein the information about the video content comprises protection information indicating whether the video content is protected and URL information of the video content.
 10. The WFD source of claim 7, wherein the information about the video content and the information about the audio content are formatted by a US-ASCII string and a UTF-8 string. 